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Summary 


Many viral and cellular mRNA species 
contain a leader sequence derived from a 
distant upstream site on the same gene by 
a process of RNA splicing. This process 
usually involves either nuclear functions 
or self-splicing of RNA molecules. Coro- 
navirus, a cytoplasmic RNA virus, un- 
folds yet another mechanism of joining 
RNA, which involves the use of a free 
leader RNA molecule. This molecule is 
synthesized and dissociates from the 
template RNA, and subsequently reas- 
sociates with the template RNA at down- 
stream initiation sites of subgenomic 
mRNAs to serve as the primer for 
transcription. This leader-primed trans- 
criptional process thus generates viral 
mRNAs with a fused leader sequence. A 
similar mechanism might also operate in 
the mRNA transcription of African 
trypanosomes. 


Introduction 


Coronaviruses are a newly recognized 
group of viruses which infect many 
animal species, causing a variety of 
respiratory and gastrointestinal illnes- 
ses. Among these viruses are porcine 
transmissible gastroenteritis virus 
(TGEV), bovine coronavirus (BCV), 
avian infectious bronchitis virus (IBV), 
feline infectious peritonitis virus 
(FIPV), etc., which cause severe epidem- 
iological problems in livestock and 
other domestic animals. Human corona- 
viruses are responsible for a significant 
number of commoncolds and diarrhoea, 
and have also been implicated in 
multiple sclerosis. Another member of 
the virus group, mouse hepatitis virus 
(MHV), is a frequent contaminant of 
laboratory mouse colonies. 
Coronaviruses share several common 


morphological and structural features. 
They consist of an enveloped virus 
particle with characteristic petal-shaped 
spikes, which give an appearance of a 
crown, or corona, thus giving the name 
for the virus. The viral envelope con- 
tains two glycoproteins, one of which 
forms the spikes and the other the 
matrix proteins. The spikes interact 
with virus receptors on the cell surface, 
thus determining the target-cell speci- 
ficity of the virus. Inside the viral 
envelope is a helical nucleocapsid which 
is composed of a nucleocapsid protein 
N and the RNA genome. The genome is 
a single piece of RNA of more than 
18,000 nucleotides. The RNA is of 
positive sense, meaning that the viral 
genome can be directly translated into 
proteins. 


Replication Pathway of 
Coronaviruses 


Coronaviruses generally infect cells of 
the same animal species of origin in 
tissue culture. Immediately after the 
virus enters infected cells, the viral 
RNA is released from the virus particles 
by a process termed ‘uncoating’. The 
viral RNA is used as messenger for an 
RNA-dependent RNA _ polymerase, 
which transcribes the genomic RNA 
into a full-length negative-stranded 
RNA.’ This negative-stranded RNA 
species then serves as the template for 
the synthesis of seven virus-specific 
mRNAs. Each mRNA is used for the 
synthesis of either a viral structural or 
nonstructural protein. The nonstructur- 
al proteins are probably used to regulate 
the replication and transcription of 
virus. One of these nonstructural pro- 
teins is the RNA-dependent RNA 
polymerase which is needed to syn- 


thesize more mRNAs as well as virion 
genomic RNA. The structural proteins, 
together with the genomic RNA, are 
used to assemble the virus particles. 
These particles, unlike most other 
enveloped viruses, bud into the endo- 
plasmic reticulum, instead of directly 
into the plasma membrane.’ The 
mature virus particles travel through 
the Golgi complex and are eventually 
released into the extracellular space. 
The virus acquires the virus-specific 
glycoproteins, El and E2, during the 
budding and transport process. 


Coronavirus mRNAs Have a 
Unique Nested-set Structure 
with a Common Leader RNA 
Sequence 


Coronavirus mRNA is comprised of a 
genomic-sized RNA and six subgenom- 
ic mRNA species. All of these RNAs 
are associated with polysomes, thus 
representing functional mRNA species 
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Fig. 1. mRNA structure of coronaviruses. The 
structure is based on that of mouse hepatitis virus. 
The open boxes represent the functional parts of 
mRNAs. The solid squares are the leader sequences 
and the wavy lines represent poly A tails. NS 
proteins represent nonstructural proteins which are 
not packaged into virus particles. El, Matrix 
protein; E2, spike protein; N, nucleocapsid 
protein. 
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in infected cells. The structure of these 
sequences is such that they each start 
from the 3’-end of the genomic RNA 
and extend for various distances toward 
the 5’-end, depending on the size of the 
mRNA species (Fig. 1).4 Thus the 
sequence of each subgenomic mRNA is 
completely contained within the 3’ 
portion of the next larger mRNA, ie. a 
nested-set structure. The 5’-portion of 
each mRNA therefore contains a 
unique sequence which does not overlap 
with the smaller mRNAs. This unique 
portion is the functional part of each 
mRNA. For instance, when MRNA 3 
was added to an in vitro translation 
system, such as reticulocyte lysate,’ or 
was injected into frog oocytes,® only the 
E2 protein was synthesized, although 
this mRNA contains several down- 
stream genes. Thus each mRNA is 
functionally monocistronic. This phen- 
omenon is probably a result of the in- 
ability of mammalian cells to utilize 
internal translation initiation codons 
of mRNAs. 

The abundance of different mRNAs 
varies. The small mRNAs are relatively 
more abundant than the larger mRNA 
species,” but this inverse relationship 
between the mRNA size and abundance 
is not universal, Furthermore, the 
genetic map of different coronaviruses 
is not co-linear. For instance, IBV 
contains an additional gene inserted 
between the genes encoding E! and N. 
Thus genetic recombination, insertion 
or deletion must have occurred during 
the evolution of the viruses. 

Another unique feature of corona- 
virus mRNAs is that the 5’-ends of all 
the mRNAs, including genomic RNA, 
contain an identical stretch of 50-70 
nucleotides,®»® termed the leader se- 
quence. These leader nucleotides are not 
repeated in the internal region of the 
genomic RNA. Therefore the leader of 
the subgenomic mRNAs must be de- 
rived from the 5’-end of the genomic 
RNA, analogous to the leader se- 
quences of many eukaryotic mRNAs. 
The presence of the leader sequences on 
the coronavirus mRNAs was first sug- 
gested by an unusual oligonucleotide,!° 
which turns out to be the fusion site 
between the leader RNA and mRNAs. 
The identity of the leader sequences has 
now been confirmed by cDNA cloning 
and sequencing of genomic RNA and 
mRNAs. In the case of MHV, the leader 
sequence is approximately 70 nucleo- 
tides long. 
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Fig. 2. Three possible models of coronavirus 
transcription. 


The Coronaviral leader RNA is 
not Derived by the Conventional 
Pre-mRNA Splicing Mechanism 


Although the presence of a leader 
sequence is a common feature of 
eukaryotic mRNAs, the synthesis of 
coronavirus mRNAs does not appear 
to utilize the conventional RNA splicing 
mechanism, which involves cleavage of 
intervening sequences from a precursor 
RNA. Two pieces of data suggest that 
the leader sequences present at the 
5’-end of coronavirus mRNAs are 
probably derived by a novel mechan- 
ism: (1) coronaviruses replicate exclus- 
ively in the cytoplasm of infected 
cells,!:12, while conventional RNA 
splicing takes place in the nucleus; and 
(2) ultraviolet (UV) irradiation of coro- 
navirus-infected cells resulted in the in- 
hibition of individual mRNA synthesis 
at a rate proportional to the size of the 
mRNA." The UV target size of each 
coronavirus mRNA is approximately 
the same.as its physical size, suggesting 
that each subgenomic mRNA is trans- 
cribed independently. If these mRNAs 
were derived from the cleavage of a 
precursor RNA, as in the case of 
conventional RNA splicing, the UV 
target size for inhibition of the synthesis 
of all the mRNAs would be expected to 
be equal to that of the genomic RNA. 
Thus a novel mechanism for the genera- 
tion of these mRNAs is most likely 
utilized by coronaviruses, 

Several transcriptional models for 
coronavirus mRNAs have been pro- 
posed (Fig. 2).4 (1) The ‘loop-out’ 
model in which the RNA template 
forms a loop in the ‘intron’ region, thus 
bringing the leader RNA in close 
proximity to the initiation sites of 
subgenomic mRNAs. The RNA poly- 


merase can thus jump from the leader 
sequence to the mRNAs in continuous 
transcription. For smaller mRNAs the 
loops will be larger. (2) The ‘leader- 
primed transcription’ model in which 
the leader RNA is transcribed and 
becomes dissociated from the RNA 
template. This ‘free’ leader RNA then 
binds to the template at the initiation 
sites of various mRNAs and serves as 
the primer for transcription. (3) The 
‘post-transcriptional processing’ model 


"* in which the leader RNA and the body 


sequences are transcribed independently 
and then joined together by a trans- 
splicing mechanism _ post-transcrip- 
tionally. A considerable body of data! 
suggests that the first and third models 
are not compatible with the replication 
mechanism of coronavirus RNA. Thus, 
the ‘leader-primed transcription’ model 
is considered to be the most likely 
mechanism utilized for the transcription 
of coronavirus mRNAs. 


Evidence for the ‘ Leader-primed 
Transcription’ Model 


The direct evidence in support of the 
‘leader-primed transcription’ model is 
multi-fold. (1) Several free leader RNA 
species of 50-90 nucleotides have been 
detected in the cytoplasm of MHV- 
infected cells. They are discrete RNA 
species, some of which are dissociated 
from the RNA template. (2) A tem- 
perature-sensitive mutant has been 
isolated which synthesizes only the small 
leader RNA but not mRNAs at the non- 
permissive temperature,'® suggesting 
that the synthesis of the leader RNA 
and the synthesis of the mRNAs are 
discontinuous and require different viral 
proteins. (3) During a mixed infection 
with two different MHVs, the mRNAs 
of each virus contain a population with 
the leader RNA sequences derived from 
the co-infecting virus.‘® This result 
suggests that the leader RNA is a 
separate transcriptional unit and can be 
freely exchanged between the mRNAs 
of two co-infecting viruses. Thus the free 
leader-containing RNA species detected 
in the cytoplasm of MHV-infected cells 
represent bona fide transcriptional in- 
termediates, rather than abortive trans- 
cription products. (4) An RNA species 
complementary to the leader RNA 
(‘anti-sense’ leader RNA) could be 
expressed in L-2 cells by a mammalian 
expression vector. When the cells ex- 
pressing this anti-sense leader RNA 
were infected with MHV, the RNA 
transcription of the superinfecting virus 
was inhibited, though not completely 
(L. Soe, unpublished observation). This 


result suggests that the availability of 
the leader RNA sequences is required 
for MHV RNA synthesis. These data 
are most compatible with the interpre- 
tation that the leader RNA serves as a 
primer for transcription. 


The Mechanism of 
Leader-primed Transcription 


The precise mechanism of leader RNA- 
primed transcription has been suggested 
from the sequence analysis of both the 
S’-end and the mRNA initiation sites on 
the RNA genomes. The sequence of the 
5’-end of MHV genomic RNA reveals a 
possible leader RNA _ termination 
signal, i.e. a hairpin loop followed by an 
AU-rich sequence, approximately 84 
nucleotides from the 5’-end (Fig. 3).!” 
This is consistent with the size of the 
free leader-containing RNA _ species 
detected in the infected cells.15 Close to 
the 3’-end of the putative leader RNA, 
there is a stretch of 9-18 nucleotides, 
which is homologous to the genomic 
sequences at the initiation points of 
various mRNAs.!7:!8 Thus the leader 
RNA would be able to bind to the RNA 
template at these sites. The homologous 
sequences are not located at the exact 
3’-end of the free leader RNA, and 
there is also a mismatched nucleotide 
within some of the homologous regions. 
These mismatched nucleotides in the 
leader RNA have to be removed before 
transcription takes place. Thus the 
mechanism of MHV RNA transcription 
involves the synthesis of a free leader 
RNA species from the 3’-end of the 
negative-sense RNA template, followed 
by its binding to the intergenic regions 
via the homologous sequences. The 
leader RNA is then cleaved at 
the mismatched point, generating the 
primer for the initiation of mRNA 
transcription (Fig. 3). Therefore, the 
leader—body fusion sites vary with the 
mRNA species, depending on the loca- 
tion of the mismatched nucleotide. 

There is a close parallel between the 
number of homologous nucleotides at 
the intergenic sites preceding each 
mRNA and the amount of RNA 
synthesized in the infected cells.!” For 
instance, there are 18, 14 and 9 
homologous nucleotides upstream of 
mRNAs 7, 6 and 4, respectively. 
Correspondingly, the molar ratio of 
these three RNAs in infected cells is 
roughly 100:31:1-7.7 In addition, other 
regulatory sequences may also be in- 
volved. The free leader RNA may be 
associated with RNA _ polymerase, 
which could assist in leader RNA 
binding. 
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Fig. 3. The postulated model of leader-primed transcription. The solid squares represent free leader RNA, 
which binds to the initiation points of mRNAs on the template RNA. The expanded regions show the 
nucleotide sequences of the intergenic regions of mRNAs 6 and 7, and the mechanism of leader RNA 
binding. The solid arrows represent the postulated cleavage points of the leader RNA. Transcription starts 
from the 3'-end of the cleaved primer. The resulting mRNAs consist of the leader sequence fused to the 


gene. 


RNA Replication May be 
Discontinuous and 
‘Stop-and-go’ 


After sufficient mRNAs are synthesized 
in the infected cells, RNA synthesis 
shifts to the replication of genomic 
RNAs to be packaged into virions. The 
transcription of mRNAs and replica- 
tion of genomic RNA appear to be 
catalyzed by separate RNA polymer- 
ases. Although the replication of the 
genomic RNA represents faithful copy- 
ing of the RNA template, it probably 
follows a discontinuous and ‘stop-and- 
go’ mode. This mechanism was sug- 
gested by the discovery of many discrete 
species of small leader-containing 
RNAs in the MHV-infected cells.1> The 
sizes of these RNA species correspond 
to the lengths between the 5’-end and 
the probable hairpin loops on the 
genomic RNA. Thus they may represent 
transcriptional ‘pausing’ intermediates 
which stop at the regions of secondary 
structure on the RNA template. These 
intermediates would subsequently par- 
ticipate in continuing transcription. This 
mode of discontinuous RNA transcrip- 
tion has also been demonstrated in 
several other systems, such as QB, MS2 
and T7 phages.?® 2° 

' The evidence that these pausing 
RNA intermediates are bona fide inter- 
mediates of RNA replication came 
from the demonstration that coronavi- 
ruses can undergo RNA recombination 
at a very high frequency,”! which almost 
matches the frequency of reassortment 
of RNA segments in reoviruses or 
influenza virus. This result suggests 
that, although coronavirus contains a 


non-segmented RNA genome, its repli- 
cation may involve the generation of 
freely dissociated RNA intermediates, 
which result from transcriptional ‘ paus- 
ing’ and that these intermediates may 
switch RNA templates. Thus this ‘stop- 
and-go’ mode of RNA replication may 
result in RNA recombination, which 
has now been demonstrated in several 
animal and plant virus systems. 


The Discontinuous, 
Leader-primed RNA 
Transcription May be a Common 
Phenomenon for Gene 
Regulation in Eukaryotic Cells 


The unique feature of leader RNA- 
primed transcription may not be limited 
to coronaviruses. A similar phenom- 
enon has been noted in African 
trypanosomes, in which a 35-nucleotide 
leader RNA sequence is associated with 
Variant Surface Glycoprotein (VSG) 
mRNAs and many other cellular 
mRNA species.??:?3 This leader RNA 
sequence is located on a_ separate 
chromosome, and is expressed as a 
137-nucleotide leader-containing RNA 
species. The mechanism of mRNA 
synthesis in this case is not clear, but 
may involve either a leader-primed 
transcription or a trans-splicing event. 

An RNA-primed transcription has 
also been noted in influenza virus, 
which requires capped (m’ GpppXm) 
cellular mRNAs as primers for its 
mRNA synthesis.?* One notable differ- 
ence between coronavirus and influenza 
virus is the lack of a specific sequence 
requirement for the primer RNA in the 
case of influenza virus. A_ similar 
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primer-dependent RNA. transcription 
has also been detected in yet another 
RNA virus, bunyavirus.”> In addition, 
retroviruses use specific tRNAs to 
prime cDNA reverse transcription. 
The leader RNA in coronavirus 
transcription may play a role in regula- 
ting the rate of RNA synthesis. This can 
be achieved by the extent of homology 
at the intergenic regions and also by the 


specific sequences in the leader RNA.. 


The leader sequences in different MHVs 
appear to vary significantly in their 
ability to promote RNA transcription 
and recombination. Further efforts in 
delineating the leader sequences and 
also the properties of the virus-encoded 
RNA polymerases should give further 
insights into the novel mechanisms of 
leader-primed transcription and discon- 
tinuous RNA replication. 
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